Cue Phrase Classi cation Using Machine

نویسنده

  • Diane J. Litman
چکیده

Cue phrases may be used in a discourse sense to explicitly signal discourse structure, but also in a sentential sense to convey semantic rather than structural information. Correctly classifying cue phrases as discourse or sentential is critical in natural language processing systems that exploit discourse structure, e.g., for performing tasks such as anaphora resolution and plan recognition. This paper explores the use of machine learning for classifying cue phrases as discourse or sentential. Two machine learning programs (cgrendel and C4.5) are used to induce classiication models from sets of pre-classiied cue phrases and their features in text and speech. Machine learning is shown to be an eeective technique for not only automating the generation of classiication models, but also for improving upon previous results. When compared to manually derived classiication models already in the literature, the learned models often perform with higher accuracy and contain new linguistic insights into the data. In addition, the ability to automatically construct classi-cation models makes it easier to comparatively analyze the utility of alternative feature representations of the data. Finally, the ease of retraining makes the learning approach more scalable and exible than manual methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cue Phrase Classi cation Using Machine Learning

Cue phrases may be used in a discourse sense to explicitly signal discourse structure, but also in a sentential sense to convey semantic rather than structural information. Correctly classifying cue phrases as discourse or sentential is critical in natural language processing systems that exploit discourse structure, e.g., for performing tasks such as anaphora resolution and plan recognition. T...

متن کامل

A spoken language system for automated call routing

We are interested in the problem of understanding uently spoken language. In particular, we consider people's responses to the open-ended prompt of 'How May I help you?'. We then further restrict the problem to classifying and automatically routing such a call, based on the meaning of the user's response. Thus, we aim at extracting a relatively small number of semantic actions from the utteranc...

متن کامل

Cue Phrase Selection In Instruction Dialogue Using Machine Learning

The purpose of this paper is to identify e ective factors for selecting discourse organization cue phrases in instruction dialogue that signal changes in discourse structure such as topic shifts and attentional state changes. By using a machine learning technique, a variety of features concerning discourse structure, task structure, and dialogue context are examined in terms of their e ectivene...

متن کامل

Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data

A reliable and precise classiŽ cation of tumors is essential for successful diagnosis and treatment of cancer. cDNA microarrays and highdensity oligonucleotide chips are novel biotechnologies increasingly used in cancer research. By allowing the monitoring of expression levels in cells for thousands of genes simultaneously, microarray experiments may lead to a more complete understanding of the...

متن کامل

The Error Coding Method and PICTs

A new family of plug-in classi cation techniques has recently been developed in the statistics and machine learning literature. A plug-in classi cation technique (PICT) is a method that takes a standard classi er (such as LDA or TREES) and plugs it into an algorithm to produce a new classi er. The standard classi er is known as the Base Classi er. These methods often produce large improvements ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996